Naming the Past: Named Entity and Animacy Recognition in 19th Century Swedish Literature

نویسندگان

  • Lars Borin
  • Dimitrios Kokkinakis
  • Leif-Jöran Olsson
چکیده

This paper provides a description and evaluation of a generic named-entity recognition (NER) system for Swedish applied to electronic versions of Swedish literary classics from the 19th century. We discuss the challenges posed by these texts and the necessary adaptations introduced into the NER system in order to achieve accurate results, useful both for metadata generation, but also for the enhancement of the searching and browsing capabilities of Litteraturbanken, the Swedish Literature Bank, an ongoing cultural heritage project which aims to digitize significant works of Swedish literature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Character Profiling in 19th Century Fiction

This paper describes the way in which personal relationships between main characters in 19 century Swedish prose fiction can be identified using information guided by named entities, provided by a entity recognition system adapted to the 19 century Swedish language characteristics. Interpersonal relation extraction is based on the context between two relevant, identified person entities. The re...

متن کامل

Gender-Based Vocation Identification in Swedish 19th Century Prose Fiction using Linguistic Patterns, NER and CRF Learning

This paper investigates how literature could be used as a means to expand our understanding of history. By applying macroanalytic techniques we are aiming to investigate how women enter literature and particularly which functions do they assume, their working patterns and if we can spot differences in how often male and female characters are mentioned with various types of occupational titles (...

متن کامل

سیستم شناسایی و طبقه‌بندی موجودیت‌های اسمی در متون زبان فارسی بر پایه شبکه عصبی

Named Entity Recognition (NER) is a fundamental task in natural language processing and also known as a subset of information extraction. We seek to locate and classify named entities in text into predefined categories such as the names of persons, organizations, locations, expressions of times, etc. Named Entity Recognition for English texts has been researched widely for the past years, howev...

متن کامل

The position of Persian language and literature in Ottoman’s 19th century literature and historical developments

With the spread of western reforms in the 13th/9th century, Ottoman’s literature was reformed either. To reform Ottoman literature, they decided to transform the Ottoman language and literature relations with Persian language and literature. On one hand, they considered problems of Ottoman literature regarding Pindaric and its inefficiency for entering new areas such as novel, drama, and journa...

متن کامل

بهبود شناسایی موجودیت‌های نامدار فارسی با استفاده از کسره اضافه

Named entity recognition is a process in which the people’s names, name of places (cities, countries, seas, etc.) and organizations (public and private companies, international institutions, etc.), date, currency and percentages in a text are identified. Named entity recognition plays an important role in many NLP tasks such as semantic role labeling, question answering, summarization, machine ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007